Textpresso is a text-mining system for scientific
literature. Textpresso's two major elements are (1) access to full text, so
that entire articles can be searched, and (2) introduction of categories of
biological concepts and classes that relate two objects (e.g.,
association, regulation, etc.) or describe one (e.g., methods, etc).
A search engine enables the user to search for
one or a combination of these categories and/or keywords within an entire
literature.
Textpresso is useful as a search engine for researchers as well as
a curation tool. It was developed as a part of WormBase and is used extensively by C. elegans curators.
Textpresso has currently been implemented for 17
different literatures, and can readily be extended to other
corpora of text.
News and updates
July 11th, 2008: A new server! All Textpresso
sites are now hosted by a new server, and the software has been updated.
Please contact us if you find sites and files missing
or not working.
Software available for
creating a new corpus - alere 0.1.0